Citations in the Digital Library of Classics: Extracting Canonical References by Using Conditional Random Fields
نویسندگان
چکیده
Scholars of Classics cite ancient texts by using abridged citations called canonical references. In the scholarly digital library, canonical references create a complex textile of links between ancient and modern sources reflecting the deep hypertextual nature of texts in this field. This paper aims to demonstrate the suitability of Conditional Random Fields (CRF) for extracting this particular kind of reference from unstructured texts in order to enhance the capabilities of navigating and aggregating scholarly electronic resources. In particular, we developed a parser which recognizes word level n-grams of a text as being canonical references by using a CRF model trained with both positive and negative examples.
منابع مشابه
Citation analysis of graduate Dental thesis references: Before and after an intervention
Background: Introduction of Iranian National Medical Digital Library (INLM) was a huge investment during several years ago. The aim of this study was to discover the effectiveness of this scientific intervention by examination of citation pattern among graduate dental thesis during before and after of INLM accessibility. Methods: This analytical study was conducted among all of graduate dental ...
متن کاملConditional Random Fields for Airborne Lidar Point Cloud Classification in Urban Area
Over the past decades, urban growth has been known as a worldwide phenomenon that includes widening process and expanding pattern. While the cities are changing rapidly, their quantitative analysis as well as decision making in urban planning can benefit from two-dimensional (2D) and three-dimensional (3D) digital models. The recent developments in imaging and non-imaging sensor technologies, s...
متن کاملCitation analysis of the articles published in Scientific and Research Journal of Oceanography
Background and aim: The scientific journals are a valid method for communication of update information and a link among various fields of science through citation. The aim of this study was to investigate the citation of the articles of 28 issues published in Scientific and Research Journal of Oceanography (JOC). Material and methods: This study investigated the citation of 290 articles publish...
متن کاملA citation analysis of specialty dissertations in Hormozgan University of medical sciences
Introduction: Citation analysis is a branch of bibliometrics in which information needs of users of a particular library can be assessed and therefore it can be used as a tool in a library collection building. This study was conducted on cited references of specialty dissertations in order to determine the reference type, their half life and language. Methods: Citation analysis on all the 55 ...
متن کاملAnnotated Bibliographical Reference Corpora in Digital Humanities
In this paper, we present new bibliographical reference corpora in digital humanities (DH) that have been developed under a research project, Robust and Language Independent Machine Learning Approaches for Automatic Annotation of Bibliographical References in DH Books supported by Google Digital Humanities Research Awards. The main target is the bibliographical references in the articles of Rev...
متن کامل